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Abstract 

At present, applications of biometrics are rapidly in- 
creasing due to inconveniences in using traditional pass- 
words and physical keys. Hand geometry, one of the most 
well-known biometrics, is implemented in many verifica- 
tion systems with various feature extraction methods. In re- 
cent work, a hand geometry verification system using time 
series conversion techniques and Dynamic Time Warping 
(DTW) distance measure with Sakoe-Chiba band has been 
proposed. This system demonstrates many advantages, es- 
pecially ease of implementation and small storage space re- 
quirement using time series representation. In this paper, 
we propose a novel hand geometry verification system that 
exploits DTW distance measure and R-K band learning to 
further improve the system performance. Finally, our evalu- 
ation reveals that our proposed system outperforms the cur- 
rent system by a wide margin, in terms of False Acceptance 
Rate (FAR), False Rejection Rate (FRR), and Total Success 
Rate (TSR) at Equal Error Rate (EER). 



1. Introduction 

Nowadays, biometrics is gradually used in place of tradi- 
tional passwords and physical keys simply because the pass- 
words could be forgotten and the physical keys could be lost 
or stolen. On the other hand, many parts of human organs or 
characteristics such as a fingerprint, hand geometry, an iris, 
voice, a face profile, and a signature, are unique enough to 
be used for person identification purposes. Specifically, Ta- 
ble Q] shows a comparison among various biometrics based 
on seven attributes ifTTl [T3l , i.e., universality, uniqueness, 
permanence, collectability, performance, acceptability, and 
circumvention. 

A biometric authentication system can be categorized 
into two main functionalities 1 13 ] - identification and ver- 
ification. For identification, a system receives an image 
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Figure 1 . Images acquired from (a) a peg-fixed 
system and (b) a peg-free system. 



from an input sensor, extracts features, and queries from 
a database for the best match. If the input image is too dif- 
ferent from the best retrieved template, the system rejects. 
Otherwise, it accepts and identifies the input's identity. Un- 
like identification, a verification system requires a user iden- 
tity as an additional input. Instead of querying from all the 
templates in the database, the verification system queries 
on only the claimed user's own templates. Matching only 
on much smaller set of templates generally makes the veri- 
fication system much more accurate than the identification 
system. 

At present, many hand geometry authentication systems 
are implemented both in identification and verification tasks 
due to the ease of sample collection, relatively low hardware 
costs, and availability of various feature extraction algo- 
rithms. Specifically, two system environments are designed, 
shown in Figure [T] i.e., a peg-fixed system l24l [T5l l26l [29l 
and a peg-free system EllSISISlESIISIDEIllMIElia. 
The peg-fixed system uses pins attached on a plate to align 
hand's position properly. On the other hand, in the peg-free 
system, no pin is needed. To transform an input image to 
computable numerical features, many extraction method- 
ologies have been proposed |T5l l28l HI H2l l25l ITSl HOl. 
Unfortunately, these extracted features techniques are quite 



Table 1. Various biometric comparison based on seven attributes [13] 



Biometrics 


Universality 


Uniqueness 


Permanence 


Collectability 


Performance 


Acceptability 


Circumvention 


Face 


High 


Low 


Medium 


High 


Low 


High 


Low 


Fingerprint 


Medium 


High 


High 


Medium 


High 


Medium 


High 


Hand geometry 


Medium 


Medium 


Medium 


High 


Medium 


Medium 


Medium 


Keystroke 


Low 


Low 


Low 


Medium 


Low 


Medium 


Medium 


Iris 


High 


High 


High 


Medium 


High 


Low 


High 


Signature 


Low 


Low 


Low 


High 


Low 


Medium 


Low 


Voice 


Medium 


Low 


Low 


Medium 


Low 


Medium 


Low 



computationally expensive and the system may fail to ex- 
tract features in some of the hand configurations. 

Recently, a hand geometry verification system using a 
time series representation ifTTl has been proposed. Some of 
its advantages include small storage space requirement and 
ease of implementation. To compare similarity between two 
time series data, Dynamic Time Warping distance measure 
with Sakoe-Chiba band l23l is used. 

In this paper, we extend this previous work further. In- 
stead of using Sakoe-Chiba band, R-K bands l2T1l are used 
and are assigned to each user's templates with calculated 
threshold values. This novel hand geometry verification 
system significantly increases overall system performances, 
especially, reduction in False Acceptance Rate (FAR) and 
False Rejection Rate (FRR) at Equal Error Rate (EER). 

The rest of the paper is organized as follows. Our pro- 
posed hand geometry verification system is described in 
Section 2. Section 3 discusses our evaluation method and 
shows the experimental results. Finally, in Section 4, we 
conclude our work and provide some suggestions. 
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Figure 2. Overviews of (a) a typical biomet- 
ric verification system and (b) our proposed 
hand geometry verification system. 



2. Hand Geometry Verification System 



Typical verification system usually consists of five ma- 
jor components fT3lh i.e., a sensor, a feature extractor, a 
matcher, a stored template, and an application device with 
two functions - enrollment and testing, as shown in Figure 
[2(a). First, data are acquired from a sensor, and then the in- 
put data are sent to a feature extractor to transform the data 
to numerical features. If the system is in an enrollment step, 
input features (from the feature extractor) will be added to a 
database as a stored template labeled with a user's identity. 
During the test phase, a matcher retrieves the stored tem- 
plates corresponding only to the claimed username, and a 
distance measure is used to calculate similarity between the 
input features and these retrieved templates. If the distance 
is less than a pre-defined threshold, the system accepts; oth- 
erwise, it rejects. 

Our proposed system, extended from the typical verifi- 
cation system, consists of six components - a hand scanner, 
a feature extractor, R-K band and user's individual thresh- 



old learning, stored templates with R-K band and user's in- 
dividual threshold, a matcher, and an application device. A 
scanner first acquires an input hand image, and then time se- 
ries data is extracted from the input image using the feature 
extractor; a centroid-based [ITU or an angle-based Q tech- 
nique is used for the time series conversion. After extraction 
has finished and the system has been in the enrollment step, 
an R-K band and user's individual threshold is learned and 
stored. In the testing step, user's templates with the learned 
R-K band and user's threshold are retrieved. The matcher 
uses DTW distance measure with this learned R-K band to 
find the best match within this stored set of templates. Fi- 
nally, the measured distance is compared to that particular 
user's threshold multiplied by a system- wide threshold to 
grant or deny access to an application device. An overview 
of our proposed system is shown in Figure [2] (b). 
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Figure 3. Steps of converting a raw hand im- 
age to time series data. 



2.1. Feature Extractor 

The feature extractor ifTTl (shown in Figure [3]) converts 
acquired images to time series according to the following 
steps, i.e., brightness and contrast adjustment, binarization, 
edge extraction, and time series conversion. 

Brightness and Contrast Adjustment. After an origi- 
nal color raw image is obtained, it will be transformed into 
a grayscaled image. Then, the system will adjust its bright- 
ness and contrast to the quality that is suitable for the bina- 
rization step. 

Binarization. After brightness and contrast adjustment, 
some image pixels may not be pure black or white. So, we 
binarize each pixel into solid black or white, represented 
by '0' and T, respectively. The binarization function is 
defined as follows 



B(x,y) 



if Ixy > t 

otherwise 



(1) 



where I xy is the intensity ranging from to 1 at pixel (x,y), 
and t is the specified threshold for binarization. The default 
threshold value for the grayscale image is simply set to 0.5. 

Edge Extraction. The goal in this step is to find an edge 
sequence from a binarized hand image by using bound- 
ary extraction algorithm [7 ]. The algorithm precisely starts 
scanning of each pixel from the top left of the image to the 
bottom right of the image. Once it finds the first black pixel, 
it stops scanning, and then traces along the edge in a clock- 
wise direction until it returns to the starting pixel. 

Time Series Conversion. In this step, we calculate the 
edge sequence, and transform hand's shape into time series 
data by using two techniques, i.e., an angle-based technique 
1 7 ] and a centroid-based technique lfT4l . In the angle-based 
technique, for each pixel index i, we create two tangent 
lines - forward and backward tangents. The forward line 
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Figure 4. General ideas of (a) an angle-based 
conversion technique and (b) a centroid- 
based conversion technique. 



is created by drawing a straight line from the pixel index i 
to the pixel index i + S, and the backward line is created 
by drawing a straight line from the pixel index i to the pixel 
index i — S. Note that the S value depends on the size of 
the image; the larger the S value, the smoother the time se- 
ries, and vice versa. In our experiment, the default value 
of S is set to 10. After that we record the angle formed by 
these two lines as time series' amplitude, as shown in Fig- 
ure H] (a). In the centroid-based technique, we first locate 
the hand's centroid. Once the centroid is obtained, we sim- 
ply plot the Euclidean distance from each pixel position to 
the centroid position. The general idea of centroid-based 
conversion is shown in Figure Sl(b). 

2.2. Dynamic Time Warping (DTW) Distance Mea- 
sure 

Dynamic Time Warping (DTW) J2| |22) distance mea- 
sure is a well-known similarity measure based on shape. It 
uses a dynamic programming technique to find all possible 
warping paths, and selects the one with the minimum dis- 
tance between two time series. To calculate the distance, 
it first creates a distance matrix, where each element in the 
matrix is a cumulative distance of the minimum of three 
surrounding neighbors. Suppose we have two time series, 
a sequence Q = (gi, ^2? ? Qn) an d a sequence 




Figure 5. DTW without using global constraint 
may introduce an unwanted warping. 



However, in many cases, this is probably not what we in- 
tend, where the two time series are expected to be of differ- 
ent classes. We can resolve this problem by limiting permis- 
sible warping paths using a global constraint. A well-known 
global constraints, Sakoe-Chiba band (S-C band) l23l and 
a recent representation, Ratanamahatana-Keogh band (R-K 
band) |21 ], have been proposed. Figure[6]shows an example 
for each type of the constraints. 

2.3. R-K Band Learning 




Figure 6. Global constraint examples of (a) 
Sakoe-Chiba band and (b) Ratanamahatana- 
Keogh band. 



C = (ci, C2, . . . , Cj, . . . , c m ). First, we create an n-by- 
m matrix where every (i, j) element of the matrix is the 
cumulative distance of the distance at (i, j) and the mini- 
mum of three neighboring elements, where 1 < i < n and 
1 < j < m. We can define the (i, j) element, 7^ , of the 
matrix as: 



7i,j = dij + mm {7i-i,i-i 5 7i-i,i ? 7i,i-i} 



(2) 



where = (q — qj) 2 is the squared distance of qi and 
Cj, and jij is the summation of and the minimum cu- 
mulative distance of three elements surrounding the (i, j) 
element. Then, to find an optimal path, we choose the path 
that yields a minimum cumulative distance at (n, m), which 
is defined as: 



DTW(Q,C) 



mm 



K 



k=l 



(3) 



where W is a set of all possible warping paths, is (i, j) 
at k th element of a warping path, and K is the length of the 
warping path. 

In reality, DTW may not give the best mapping accord- 
ing to our need because it will try its best to find the min- 
imum distance though it may generate an unwanted path. 
For example, in Figure[5l without a global constraint, DTW 
will find its optimal mapping between two time series. 



Ratanamahatana-Keogh band (R-K band) L21J is a gen- 
eral model of a global constraint specified by a one- 
dimensional array R, i.e., R = (n, r2, . . . , n, . . . , r n ) 
where n is the length of time series, and T{ is the height 
above the diagonal in y direction and the width to the right 
of the diagonal in x direction. Each ri value is arbitrary, 
therefore R-K band is also an arbitrary- shape global con- 
straint, as shown in Figure 0(b). Note that when Ti = 0, 
where 1 < i < n, this R-K band represents the Euclidean 
distance, and when = n, 1 < i < n, this R-K band rep- 
resents the classic DTW distance with no global constraint. 
The R-K band is also able to represent the S-C band by giv- 
ing all ri = c, where c is the width of a global constraint. 
Moreover, the R-K band is a multi band model which can 
effectively be used to represent one band for each class of 
the data. This flexibility is a great advantage; however, the 
higher the number of classes, the larger the time complexity, 
as we have to search through such a large space. 

Since determining the optimal R-K band for each train- 
ing set is quite computationally intensive, a hill climbing 
and heuristic functions have been introduced to guide which 
part of space should be evaluated. A space is defined as 
a segment of a band to be increased or decreased. In the 
original work, two heuristic functions, accuracy metric and 
distance metric, are used to evaluate a state. The accuracy 
metric is evaluated from the training accuracy using leave- 
one-out 1-NN, and the distance metric is a ratio of the mean 
DTW distances of correctly classified and incorrectly clas- 
sified objects. 

Two searching directions are considered, i.e., a forward 
search, and a backward search. In a forward search, we 
start from the Euclidean distance (all values in R equal 
to 0), and parts of the band are gradually increased in each 
searching step. In the case where two bands have the same 
heuristic value, a wider band is selected. On the other hand, 
in a backward search, we start from a very large band (all 
ri values in R equal to n, where n is the length of the time 
series), and parts of the band are gradually decreased in each 
searching step. If two bands have the same heuristic value, 
the tighter band is chosen. 

Our learning algorithm first starts from enqueuing the 
starting- and ending-parts of the R-K Band. In each itera- 
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Figure 7. An illustration of the concept in R-K 
band forward searching algorithm. [21 j 




Figure 8. Examples of input hand images from 
four different people. 



Table 2. The pseudo code for multiple R-K 
bands learning. 

Function [band] = Learning[T, threshold] 
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11 
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7V= size of T; 
L= length of data in T; 
initialize frand^ for i = 1 to c; 
foreachclass i = 1 to c 

enqueue(l, L, Queued; 
endfor 

best_evaluate = evaluate(T, band); 
while !empty(Q^e^e) 
foreachclass z = 1 to c 
if !empty(Q^e^ei) 
[start, end] = dequeue(Q^e^ei) 
adjustable = adjust(kmdz, start, end); 
if adjustable 
evaluate= evaluate(T, band); 
if evaluate > best_evaluate 
best_evaluate = evaluate; 
enqueue(star£, end, Queue i); 
else 

undo_adjustment(6and^, start, end); 
if (start - end) I 2 > threshold 
enqueue(s£art, mid-1, Queued; 
enqueue(?77id, end, Queued; 
endif 
endif 
endif 
endif 
endfor 
endwhile 



tion, these values are dequeued and used as a boundary for a 
band expansion/reduction. The adjusted band is then eval- 
uated. If a heuristic value is higher than the current best 
heuristic value, the same start and end values are enqueued. 
If not, this part is further divided into two equal subparts be- 
fore being enqueued, as shown in Figure |7l The iterations 
are continued until a termination condition is met. Table [2] 
shows the pseudo code for this multiple R-K bands learning. 

3. Experimental Evaluation 

The dataset is collected from 21 different persons, 6-7 
images per person, in a total of 128 images, using a color 
scanner with 1200x1200 pixel resolution. Examples of in- 
put hand images are shown in Figured 

After that all images are converted into time series data 
by applying two time series conversion techniques - a 
centroid-based conversion technique and an angle-based 
conversion technique, and then time series data are down- 
sampled to 50 data points. Examples of converted time se- 
ries data using the centroid-based technique and the angle- 
based technique are shown in Figure [9] (a) and (b), respec- 
tively. 

To evaluate the system's performance, we use three well- 
known measurements, i.e., False Rejection Rate (FRR), 
False Acceptance Rate (FAR), and Total Success Rate 
(TSR). They can be calculated by the following equations. 



FRR ^^ e 3 ec ^enuineClaims ^qq^ 
Total^Genuine Access 



(4) 



FAR = #AcceptImposterClaims 
Total# I mposter Access 
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Figure 9. Time series data examples after the 
conversion using (a) the centroid-based tech- 
nique and (b) the angle-based technique. 





Previous approach 




Centroid-based technique 


Angle-based technique 


FAR 


14.06% 


25.59% 


FRR 


14.30% 


25.78% 


TSR 


85.71% 


72.99% 










Proposed method 




Centroid-based techniques 


Angle-based techniques 


FAR 


11.72% 


23.67% 


FRR 


11.91% 


23.43% 


TSR 


88.10% 


76.33% 



Table 3. The comparison of FAR, FRR, and 
TSR among different approaches at EER. 



TSR- 



1 



FAR + FRR \ 
Totalj^ Access J 



• 100% 



(6) 



To get the best performance, two parameters are varied 
- a systemwide (global) threshold and a conversion tech- 
nique. An experimental result at the Equal Error Rate (EER) 
point, where FAR equals to FRR, is shown in Table [3] 
Note that the lower the EER value, the higher the accuracy 
of the biometric system. To further illustrate our overall 
performance, we also show their ROC (Receive Operating 
Characteristic) curves (shown in Figure ITOb. At the EER, 
our proposed method gets better performance (lower FAR, 
lower FRR, and higher TSR) in all evaluation metrics. 

4. Conclusion 

We have demonstrated our novel hand geometry verifi- 
cation system by using time series data as an image repre- 
sentation and comparing time series using Dynamic Time 
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Figure 10. The ROC curves of our proposed 
system comparing with the previous one us- 
ing (a) the centroid-based conversion tech- 
nique and (b) the angle-based conversion 
technique. 



Warping distance measure with the R-K band. In the pro- 
posed system, two time series conversion techniques are 
applied, i.e., the centroid-based conversion technique and 
the angle-based technique. Our experiment reveals that the 
centroid-based technique generally outperforms the angle- 
based technique by achieving lower EER and higher TSR. 
Finally, by comparing the result with the previous approach, 
our proposed system gets much higher performance. 

For future work, this hand geometry verification system 
can be extended to be a multi-biometric system. The fin- 
gerprint and palm print can be verified together in order to 
boost up the system's performance. 
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